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MULTICASTING METHOD AND Comments (RFC) 1112 and 1458 which are reproduced at 

APPARATUS Appendices A and B of the Savetz book and in D. P. 

Brutaman et al., "MBONE provides Audio and Video Across 

This is a continuation, of application Ser. No. 09/435, the Internet," IEEE Computer, Vol. 27, No. 4, pp. 30-36 

732, filed Nov. 8, 1999, now U.S. Pat. No. 6,119,163 which 5 (April 1994), all of which are incorporated herein by refer- 

is a continuation of application Ser. No.09/1 10,369, filed Jul. ence. 

6, 1998 (now U.S. Pat. No. 5,983,005), which is a continu- Citation of the foregoing documents is not to be construed 

ation of application Ser. No. 08/644,072, filed May 9, 1996 as an admission that any of such documents is a prior art 

(now U.S. Pat. No. 5,778,187), and such applications are publication relative to the present invention, 
hereby incorporated by reference. io 



FIELD OF THE INVENTION 



SUMMARY OF THE INVENTION 



The present invention is a scalable architecture for deliv- 

This relates to a method and apparatus for providing audio ery of rea l-time information over a communications nct- 

and/or visual communication services, in real-time to a work Embedded into the architecture is a control mecha- 

multiplicity of identifiable users on a communications nism mat prov ides for the management and administration 

network, such as the Internet. In a preferred embodiment, the 0 f WD0 are t0 rece i V e the real-time information, 

invention monitors which users are receiving signals on ]n ^ ferred embodiment) lhe Momiliion being deliv- 

whichoneof a plurality of channels and modifies > the content ered fa M . audio Hq h ^ ^ ^ ^ 

of at least some signals in response thereto. A particular w ^ QT Q{h&T q{ information {hat can ^ 

application is to provide services akin to multi-channel radio f raDsmiUed ovef a di ^ tal ^ M TOs Mormation is 

or television with commercial programming content ^ * mmbcT q{ distribuled 

adjusted m accordance with the identity of the individual users u ^ real _ {ime in ^ for a give / channel of 

user information, approximately the same information is being 

BACKGROUND OF THE INVENTION 25 ^ a * approximately the same time to everyone who is 

enabled to receive the information. 

Systems such as the Internet typically are point-to-point nrui*u i.-i l. * t • c 

, J . . ,., Jr J . v ,5 Preferably, there are multiple channels of information 

(or umcast) systems in which a message k converted into a simultaneousl to te delivered t0 uselS) each 

series of addressed packets which are routed from a source . , ... r * j ^ * . c c ** 

. . . , c t . , channel consisting of an independent stream of information. 

node through a plurality of routers to a destination node. In „ A , _ . »JL.~ ■ ~ * t ~i ^i u..» 

*^ . r . J . . . . * - 30 A user chooses to tune m or tune out a particular channel, but 

most communication protocols the packet includes a header , , , #u #• * u-u.u u i -i* . -u . ■* 

, . , . , , , r . , « j does not choose the tune at which the channel distributes its 

which contains the addresses of the source and the destina- . f A , # # . x ■ e 

, , , . . .„ , information. Advantageously, interactive (two-way) lnfor- 

tion nodes as well as a sequence number which specifies the . . ° . , . J . v ... , J \ 

cket's order in the messa e mation can be incorporated into the system, multiple streams 

pac e s or er in e message. o ^ information can be integrated for delivery to a user, and 

In general, these systems do not have the capability of 35 certain portions of the information being delivered can be 

broadcasting a message from a source node to all the other tailored to the individual user, 
nodes in the network because such a capability is rarely of 

much use and could easily overload the network. However, BRIEF DESCRIPTION OF THE DRAWING 
there are situations where it is desirable for one node to 

communicate with some subset of all the nodes. For ^ These and other objects, features and advantages of our 

example, multi-party conferencing capability analogous to invention will be more readily apparent from the following 

that found in the public telephone system and broadcasting Detailed Description of a Preferred Embodiment of our 

to a limited number of nodes are of considerable interest to invention in which 

users of packet-switched networks. To satisfy such demands, FIG. 1 is a schematic diagram depicting an overview of 

packets destined for several recipients have been encapsu- 45 the system of the present invention; 

lated in a unicast packet and forwarded from a source to a FIG. 2 is a schematic diagram depicting the network 

point in a network where the packets have been replicated control center for the system of FIG. 1; 

and forwarded on to all desired recipients. This technique is F i G 3 ^ a schematic diagram depicting a unicast distri- 

known as IP Multicasting and the network over which such bulion structure; 

packets are routed is referred to as the Multicast Backbone c n vir a ^ « t - A - „ Mm , • „ m , ... . 

wn^KTP w .1 . « .| * * FIG. 4 is a schematic diagram depicting a multicast 

or MBONE. More recently, routers have become available . t 0 r & 

... 4 . ... ... / , .j v distribution structure; 

which can route the multicast addresses (class D addresses) 

provided for in communication protocols such as TCP/IP FIG - 5 15 a schematic diagram depicting the connection 

and UDP/IP. A multicast address is essentially an address for ^ etween the media server and the user in the s y stem of nG - 

a group of host computers who have indicated their desire to 55 

participate in that group. Thus, a multicast packet can be FIGS. 6-17 are timing diagrams which depict various 

routed from a source node through a plurality of multicast aspects of the operation of the system of FIG. 1; and 

routers (or mrouters) to one or more devices receiving the FIGS. 18 and 19 depict the user interface for control of the 

multicast packets. From there the packet is distributed to all system of FIG. 1. 

the host computers that are members of the multicast group, Where the same reference numerals appear in multiple 

These techniques have been used to provide on the drawings, the numerals refer to the same or corresponding 

Internet audio and video conferencing as well as radio-like structure in such drawings, 
broadcasting to groups of interested parties. See, for 

example, K. Savetz et al. MBONE Multicasting Tomorrow's DETAILED DESCRIPTION OF THE 

Internet (IDG Books Worldwide Inc., 1996). 65 PREFERRED EMBODIMENT 

Further details concerning technical aspects of multicast- Referring to FIG. 1, the system' of the present invention 

ing may be found in the Internet documents Request for comprises a Network Control Center 10, a plurality of 
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Primary Servers 20, Media Servers 30, Users 40 and Control switched network, such as the Internet. The control archi- 

Servers 50 and an Administration Server 60. The servers are tecture represents a second scalable system integrated with 

interconnected by a communications network, which in the the distribution architecture for managing and administering 

preferred embodiment is the global connected internetwork the delivery of that information. 

known as the Internet. The Network Control Center 10 is the 5 The remainder of this description is divided into three 

source of the information being distributed. It receives audio sections. In the next section the distribution architecture will 

feeds from satellite, over the air broadcast or in other ways Dc described in more detail. Following that, the control 

and processes this information for delivery over the network architecture will be described. In the third section the User 

on multiple channels of information. This processing con- interface will be illustrated. 

sists of optionally recording the information for nature to j. Distribution Architecture 

broadcast and dynamically inserting paid commercial adver- ™, 4 . _ .« £ ■ j i * c 

3 J &r The distribution architecture provides for the delivery of 

tisements ... j 

real-time information to any number of Users distributed 

For each channel of information, there is a Primary Server throughout a network. As will be described in detail below, 

20 that receives the stream of information from the Network the distribution architecture is scalable to allow for efficient 

Control Center 10 and compresses the information stream to « deiivery of muUiple simultaneous information channels in 

allow for more efficient transmission. The Primary Servers rea l-time to a large number of Users. 

20 are directly connected to the network. , n tfae preferred cmbodimcnt> the informati on which is 

The Primary Servers forward information via the network being distributed consists of high-quality audio in addition 

to a number of Media Servers 30. There may be a large t o other information. It should be appreciated that the basic 

number of Media Servers and in fact there may be many architecture and other general principles set forth herein 

levels of Media Servers. For example, a Media Server which wou id a is 0 apply to the delivery of video, graphics, text or 

receives a stream of information from a Primary Server may any omer type of information that can be delivered over a 

forward that stream via the network to another Media Server digital network. In addition, it should be appreciated that an 

35 which then forwards it to a User 40. This multilevel information stream can consist of audio with supplemental 

hierarchical structure is described in more detail below. information such as text and graphic images and commands 

The topology of the Internet dictates the ideal placement to control software running on the User's computer, 

of Media Servers, the fan-out of each Media Server and the The source of information in the preferred embodiment is 

number of levels of Media Servers between the Primary the Network Control Center 10, depicted in the schematic 

Server and Users. For example, the Media Servers which 3Q diagram of FIG. 2. Control Centers of this type of design are 

feed from a Primary Server might be placed at a major points available from Broadcast Electronics, Inc. and are similar to 

of presence (POPs) of each of the large Internet service what would be found in a conventional radio station serving 

providers. These Media Servers might also be placed near multiple frequencies. 

clouds which serve as high bandwidth exchange points Referring to FIG. 2, the incoming signal can be received 

between the major carriers. Similarly, Media Servers which 35 in a variety of ways such as from a satellite, over-lhe-air 

feed to Users might be placed on or close to networks which broadcast, cable or hard disk. It is then processed by 

have a large number of subscribers to minimize the distance Receiver/Decoder 110, which decodes the signal and pro- 

and number of data streams being transmitted. vides an i DCOming aud i 0 stream. Routing Switcher 120 is 

Control Servers 50 are responsible for keeping track of responsible for routing the incoming audio feed from the 

which Users are listening to which channels and for direct- ^ Receiver to either Delay Recording Workstation 140 or to 

ing the Media Servers to start and stop streams of inform a- one of the Playback/Control Workstations 130. Real-time 

tion to those Users. The Control Servers are also responsible insertion of paid commercial advertising takes place at the 

for handling other interactions among the various compo- Playback/Control Workstations and the resulting integrated 

nents of the system as will be described in more detail below. audio stream is delivered to the Primary Servers. The Delay 

Each Control Server is responsible for managing a cluster of 45 Recording Workstation is responsible for recording an 

Media Servers; and each Media Server is managed by a incoming broadcast so that it can be played back at a later 

single Control Server at any given time. As a result, the time. 

Control Servers are distributed throughout the Internet, Supervisory Workstation 150 is responsible for managing 

preferably located close to the Media Servers. and controlling the Playback/Control Workstations, Delay 

The Administration Server 60 is responsible for register- 50 Recording Workstations and other computers as may be 

ing new Users, authenticating Users who want to log onto connected to the local area network within the Network 

the system, and maintaining audit logs for bow many Users Control Center. Production Workstation 160 and 

are listening to which channels and at which times. Main- AudioVAULT-NFS Server 170 are used to manipulate audio 

taming audit logs and gathering statistics are features critical samples, such as commercial messages for use by the 

to monitoring the delivery of paid commercial messages as 55 Playback/Control Workstations. The audio being delivered 

well as for other purposes. For example, for purposes of can consist of syndicated TV or radio programs, such as 

assessing copyright royalties, the audit logs can record the would be received over satellite or cable and delivered as 

number of listeners for each musical or video selection that described above. These can be delivered live and/or played 

is distributed by the system. Another application is to back at a later time. It is also possible for the delivery of 

determine the percentage of listeners who are interested in 6 o information, such as music, to take place from information 

listening to a particular musical selection by determining that is all stored locally such as on a hard disk. A new play 

how many listen to the entire selection and how many turn list and its associated music data can then be downloaded 

it off* periodically to update the channel. Additionally, it is pos- 

The system of the present invention can be considered a sible to deliver commercial-free programming, for example 

distribution architecture integrated with a control architec- 65 public service announcements or label-specific music, 

hire. The distribution architecture handles scalable real-time In the preferred embodiment the Primary Servers are 

delivery of information to any number of Users on a packet responsible for compressing the audio stream using an 
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advanced perceptual technique developed and licensed by sequence of individual pieces of information, or packets. 

AT&T Corp. and Lucent Technologies, Inc. This highly . Thus the distribution architecture implements a form of 
sophisticated algorithm is used to maximize the benefit of multicast packet delivery to a group. The group in this case 

the bandwidth available. Advantageously, two bitrates are is the set of all Users who are listening to a given channel 

available, a first rate of approximately 20 Kbps and a second 5 al * given time. Group membership is dynamic, Users can 

rate of approximately 56 Kbps. Using the perceptual start "d st0 P herring to a channel at any time, 

technique, the quality of the first rate is similar to FM Multicasting can be implemented in a variety of ways, any 

monaural (with a sampling rate of approximately 22,000 or a11 of wnich can te vscd ^ e P re sent invention. In the 

16-bit samples per second) and the second rate is close to preferred embodiment, the Media Servers receive unicast 

CD quality stereo (with a sampling rate of approximately 10 P acket streams and thc y then duplicate these streams into 

32,000 16-bit samples in stereo each second). Ine signals at more unicast streams to other Media Servers which are m the 

the two different bitrates comprise two different audio chan- membership group for that stream. The lowest level Media 

nels and thus require two different compression processes. Servers use hardware broadcast multicast and/or unicast to 

_ 7 , _ . reach all Users served by that Media Server. 

Tta computational requirements of compressing an audio , f ^ Media d tQ , he ^ 

stream in real time using techniques such as the advanced 15 , . . . lL TT , * , , 

. , . • . , i/W v c r» *• physical net work as the User, hardware broadcast or mul ti- 

perceptual technique are approximately 100% or a Pentium- r \ , , , , • , n TT 

n ^/vk \mi- . j * *• i * cast can be used to transmit the packet stream to all Users 

Pro 200 Mhz computer and the computational requirements , A , . . , r . T ... w 

r , . r ,. • i listening at that time on that network. In this case the Media 

of decompressing an audio stream in real time are approxi- „ , , , . . i ■ * i_ j . 

. i Z\of r n nc \mu . i? . Servers can translate the incoming packets into broadcast or 

mately 30% of a Pentium 75 Mhz computer. Future ... . . - . r , , , , 

J % i i . , .J u -in multicast packets for transmission on the local network, 

improvements and/or changes to the algorithm could sig- 20 « , . r , t * ■ * -. • *i_ i i 

./ . , . & . * . & Only a smgle packet is transmitted at-a-time on the local 

nificantly change these requirements. For the present, a * ij . j . .u i i 

. # j * . . . ... n . c \ network and any computer directly connected to the local 

dedicated computer is required within the Primary Server to . • . * T r _. i.- . • i_ 

• * . A ^ ™ , . network can receive that packet. Hardware multicast is built 

compress the audio stream. The decompression process . , . , . r . . , . 4 . 

♦ t i , TT , , j fui u into most networks and it is lower in overall overhead than 

takes place on end Users computers and preferably would . , . j * * * , * ^ • 

, t . c . r f \ . . hardware broadcast since computers not interested m a 

use only a portion or the computers computational 25 . - • j . . i 

. : t ,, 4 , . . u I c *u transmission do not have to process the packets. In the case 

requirements, allowing the computers to be used for other . „ . ■ TT *_ • *i_ 

t i .M *i_ »i_ i ■ that a Media Server is serving a User who is not on the same 

tasks while they are processing the audio stream. , . , 4 , . & ... , 4 . . 4 

r & physical network, a unicast transmission is used to reach that 

It is important to appreciate that the compression and User, which requires a separate packet transmission for each 

decompression techniques employed by the present inven- User M connected. In the preferred embodiment, the assign- 

tion are not critical to the overall operation of the system and ment of Users t0 Media Servers fa done ^ 

the advantages obtained therefrom could be obtained with transactions among the User 40, Control Servers 50, and 

other compression methodologies. Advantageously, the Administration Server 60. This system will be described 

identity of the compression technique used can be encoded more Mly in me following section, 

into the audk) stream in the packet header. This makes it Multicasting can also be implemented within the Internet 

possible to identify to the receiver the nature of the decom- a( (he , p ^ ugi Ip ^ D ^ ^ , GMp 

pression algorithm to use; and thereby make it possible for group control protocol. FIG. 4 illustrates how the multilevel 

the computer within the Primary Server to select an opti- hierarchica i distribution architecture would operate using IP 

mum compression algorithm depending on the nature of the muUicast ddi Undef ^ &ys{ ^ a packet ^ transmi t ted 

audio stream to be compressed. ^ ^ a m}lllicilsX address for a destination md eacb router 

The remainder of the distribution architecture comprises maintains group membership lists for each interface that it is 

the multilevel hierarchy of data transmission originating at connected to and will forward packets across the Internet to 

the Primary Server 20 and terminating at the Users 40 as otner rou ters such that all Users within the global group 

shown in FIG. 3. In the preferred embodiment, the network eventually receive a copy of the packet. Unless and until all 

is the global connected Internet. It can also include private 45 routers within the Internet understand multicasting in this 

networks which are connected to the Internet and it could be waVf it is necessary to supplement it with IP tunneling in 

implemented on any packet switched network, cable- which multicast packets are encapsulated in unicast packets 

modem-based or satellite-based cable system. It is possible and rou tcd by unicast routers to a multicast routers. The 

that certain links within the overall system, for example, the present invention can and will be able to take advantage of 

link between the Primary Server and the first level of Media 5Q !P multicasting as it becomes widely available. Each channel 

Servers, are private data links which carry only data asso- c f information would be given its own class D address and 

ciated with this system. This could also be true of other data t he Media Server would then simply transmit packets using 

transmission paths in the distribution architecture. The User the appropriate IP destination address. In this case no Media 

receiving the information preferably can be anyone who has Servers would be used as this function would be accom- 

access to the Internet with sufficient bandwidth to receive the 55 phshed by the routers in use to store and forward other IP 

resulting audio data. packets. 

It should be appreciated that the distribution architecture Thus it can be appreciated that the implementation of the 

of the present invention provides for scalability. Using such multicast delivery structure can be implemented using a 

a structure, any number of Users, and as widely distributed combination of IP unicast, IP multicast and hardware mul- 

as necessary, can be accommodated. In the preferred 60 ticast or any other system which provides for distributed 

embodiment, the fan-out at each level of Media Server deliveryofinformationtoaspecificgroupofdestinations.lt 

(given the state of technology today) is on the order of ten, is expected that special relationships with Internet providers 

but the same structure could be applied with other fan-outs. will be established so that delivery of the audio steams can 

The location and fan -out of the Media Servers is chosen to take place with a guaranteed bandwidth and in the most 

minimize overall network bandwidth consumed. 65 efficient way possible. 

The flow of information from Primary Server 20 through In the preferred embodiment, packets of information for 

network to User 40 is based on the delivery of a continuous distribution use the UDP protocol under IP rather than the 
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TCP protocol. TCP provides for reliable stream delivery but 
at the cost of retransmission and delays. For real-time 
information, it is usually more appropriate to use UDP since 
the information is time critical and low latency is more 
important that reliability. Since TCP is a point-to-point 
protocol, it is incompatible with IP multicasting. However, 
TCP could be used on the IP unicast links between Media 
Servers which are expected to have very low packet loss. In 
order to handle out of order, lost, duplicate and corrupted 
packets, the UDP packets are serialized. 

In the preferred embodiment the size of the audio packets 
being transmitted is variable and can change on a packet by 
packet basis. It is expected that when using compression 
schemes that have a fixed bit rate, such as ADPCM, all 
packets for that stream would be the same size. Alternatively 
when using a variable bit rate compression algorithm, it is 
expected that packet size would vary so as to establish 
approximately the same amount of time for each sample. For 
example, if each packet corresponds to a 20 millisecond 
segment of speech, this could correspond to 100 bytes 
during one time period and 200 bytes during another. 
Additionally, the Media Server may choose to dynamically 
vary the packet size to accommodate changes in network 
conditions. 

Since the resulting playback of audio information is 
sensitive to packet loss and network congestion, software 
running on the various computers which make up this 
system monitor the ongoing situation and adapt to it in the 
best possible way. This may involve using different Media 
Servers and/or lowering the data rate to the User. For 
example, similar to analog dynamic signal quality negotia- 
tion present in many analog radio receivers, the User soft- 
ware may request a lower bitrate until the situation is 
improved. Also, note that the audio information being deliv- 
ered to the User is preferably interleaved so that a contigu- 
ous segment of the audiostream is distributed for transmis- 
sion over several packets. As a result, the loss of one packet 
is spread out over multiple audio samples and causes mini- 
mal degradation in audio. Advantageously, a small degree of 
redundancy may be incorporated within the audio stream to 
further guard against packet loss. 

Preferably, there are two bitrate options available to the 
User for audio delivery. These are approximately 20 Kbps 
for standard audio and approximately 56 Kbps for high 
quality audio. Thus, a 28.8 Kbps modem connection over an 
analog phone line is sufficient to listen to standard audio 
broadcasts. To listen to high quality audio, an ISDN con- 
nection to the Internet is required, or some other connection 
with greater than 56 Kbps bandwidth. It should be appreci- 
ated that higher band widths are currently becoming avail- 
able to end Users. In particular the use of cable modems and 
residential fiber networks are enhancing the bandwidlhs 
available to Users and thus making broadcasts of higher 
bitrates more practical. 

In addition to the content of the audio channel being 
delivered, it is also possible to deliver out of band of side-bar 
information such as graphics, images and text. This side-bar 
information is synchronized with the audio channel. This 
may only involve small increases in bandwidth 
requirements, such as 1-2 Kbps. For example a music 
program could deliver images of an album cover, the text of 
song lyrics, or URLs for use by a Web browser. The User can 
preferably choose to have the side-bar information show up 
automatically or be hidden. It is also possible to incorporate 
two-way interaction into the system, such that for example 
Users can participate in a global chat session during the 
audio broadcast. These and other details are explained in 
more detail below under the description of the User inter- 
face. 

The delivery of paid commercial advertising information 
is an important aspect of the present invention. Advertising 



may be incorporated into the audio stream within the Net- 
work Control Center as described above. It may also be 
incorporated into the audio stream at the User level, or at 
some intermediate point in the distribution architecture. 

5 In addition, the side -bar information discussed above can 
also include advertising content. FIG. 5 illustrates the pro- 
vision to the User of two separate streams 32, 34 of packets, 
one of which may be used for advertising. In this case the 
insertion of the stream of commercial advertising into the 

io non-commercial stream occurs on the User's computer. FIG. 
5 also illustrates packet stream 36 which identifies the User 
to the system. This enables the system to monitor which 
Users are listening to which channels and also allows the 
system to vary, for example, the advertising content deliv- 

15 ered to a User. 

One advantage of this alternative is to allow targeted 
commercial delivery based on the individual User. That is, 
an individual User would receive the main audio feed plus 
a particular advertising stream unique to his demographic 
group. Note that the advertising stream typically is lower in 
overall bitrate and generally does not require real-time 
delivery, thus lowering the overall load on the network. For 
example, the advertising stream could be delivered to the 
User in advance of the regular programming, stored in a 
buffer in the User's computer and inserted into the stream of 
regular programming upon receipt of a cueing signal embed- 
ded in the stream of regular programming. Thus, a substan- 
tial number of targeted groups, perhaps 10 or 100 or even 
more could be accommodated without an impractical 
increase in network load. 

II. Control Architecture 

The control architecture described in this section is 
responsible for managing and administering the Users who 
are receiving the information being delivered by the distri- 
bution architecture described in the previous section. The 
control architecture handles new User registration, User 
login, the starting and stopping of audio streams and the 
monitoring of ongoing transmissions. The control architec- 
ture is scalable just as is the distribution architecture so that 
any number of Users can be managed. 

This section describes the control protocol, which consists 
of the format and sequence of control messages that are 
exchanged among Users, Control Servers, Media Servers, 
Primary Servers and the Administration Server. These mes- 
sages are in the form of objects, which have specific data 
formats. Objects are exchanged preferably using the TCP 
protocol although other options are possible. Below we 
describe the sequence of objects passed among the various 
computers and detail the internal structure of each object. 

The major objects used in the present embodiment of the 
invention are set forth in Table 1. For each object, Table 1 
provides a brief description of its function, identification of 
the names of the fields in the object, their types and a brief 
description of their function. 

TABLE 1 



20 



25 



30 



35 



40 



50 



55 



Field Name 



Field Type 



Remarks 



Channel Activation Object 
Contains information used for channel activation/deactivation. It is sect 
to Media and Primary Servers to tell them to carry or stop carrying a 
specific channel. Media Servers get the channel from another server in 
the system hierarchy and Primary Servers get and encode the feed from 
the actual input source. 



65 



Token 
Moniker 



Security Token Object 
Moniker Object 



unique channel identifier 
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TABLE 1 -continued 



TABLE 1-continued 



Field Name 



Field Type 



Remarks 



Activate 

CompressType 

Host 



Ini 
Int 

Host Object 



action flag (activate/ 
deactivate) 

type of compression to 
use 

host carrying the channel 



Channel Guide Object 
Contains analytical and descriptive information for an item requested 
that is uniquely identified by a moniker. It is usually the reply to a 
Channel Guide Request object. 



Token 

Type 

Result 



Security Token Object 
Int 



type of content 

the content data itself 



Channel Guide Request Object 
Conveys a request for analytical and descriptive information about an 
item uniquely identified by the contained moniker. The reply is in the 
form of a Channel Guide object 



Token 

Type 

Moniker 



Security Token Object 
Int 

Moniker Object 



inherited from base class 
type of content 
unique identifier 



Host Object 

Encapsulates the attributes of a networked computer related to the 
operation or services it offers or requests. 



Token 
HostName 

PortNumber 
DisplayName 



Security Token Object 
String 

Int 

String 



computer name and 
domain 

port number for service 
descriptive computer 
name 



Login Information Object 
Encapsulates the name and password by which a User is known to the 
system. 



Token 
Login 

Password 



Security Token Object 
String 

String 



User's system login 
name 

User's system password 
(possibly encrypted) 



Media Control Interface (MCI) Request Object 
Encapsulates a multimedia control command, such as play and stop, and 
any extra information that may be necessary to perform the requested 
service. 



Token 

Command 

String 



Security Token Object 
Int 

String 



multimedia command 
command-specific extra 
info 



Moniker Object 

A moniker encapsulates the name of an object or process with the 
intelligence necessary to work with that name. In other words, it 
provides naming and binding services. The Moniker Object is used in 

the system for unique identification of various components, parts or 
features, such as a channel, a directory, or a computer list. 



Token 
ID 

DisplayName 



Security Token Object 

String 

String 



unique string identifier 
User-readable name 



Ping Object 

Ping is the name given to the "Are-you-Alive?" operation useful in 
determining if a specific computer is up and running. This object is 
used in the system when a server has to be queried for its operation 
status. It can also provide timing information for statistical purposes 
and quality of service evaluations. 



Token 

Date 

Time 



Security Token Object 

Date 

Tunc 



system date 
system time 



45 



Field Name 



Field Type 



Remarks 



Protocol List Object 
Encapsulates a general purpose collection object 



Token 
Type 



Security Token Object 
Int 



type of object list 



10 Result Message Object 

Acts as the acknowledgment for a requested service successfully carried 
that out or reports errors that occur in the system during a client/server 
transaction. 



Token 
15 Code 
Message 



Security Token Object 
Int result code 

String message corresponding 

to code 



20 



Security Token Object 
Contains the authorization key for a transaction. The key must be 
validated before any service is performed. 



ID 



String 



authorization key/ 
transaction ID. 



25 



30 



35 



Server Activation Object 
Contains information used in the server activation/deactivatton process. 
Used for announcement as well as command purposes (e.g., a server can 
notify the administration database that is now activated or a server can 
be instructed to manage someone else). 



Token 


Security Token Object 




Active 


Int 


action flag (activate/ 






deactivate) 


Manage 


Int 


control flag (manage/ 






associate) 


Type 


Int 


server type 


Host 


Host Object 


host to be controlled 




Server List Request Object 


Encapsulates a request for a list of available server resources for an 


identified 


service (e.g., a request for a list of Control Servers for a 




specified channel). 




Token 


Security Token Object 




Type 


Int 


type of service 


Moniker 


Moniker Object 


content/channel unique 






identifier 


Host 


Host Object 


local host information 



Statistics Object 
Contains system-related information that can be used by load- 
balancing algorithms and for statistical purposes. 



Token 


Security Token Object 




Load 


Int 


load on the system 


Threads 


Int 


number of threads 






running 


Users 


Int 


number of Users being 






serviced 


Uptime 


Int 


amount of time running 


NumberManagcd 


Int 


number of managed 


Number Associated 


Int 


servers 






number of associated 






servers 



60 



Statistics Request Object 
Encapsulates a request for system- related information that can be used 
by load- balancing algorithms and statistical purposes. 



Token 
Load 
Threads 
65 Users 
Uptime 



Security Token Object 

Int 

Int 

Int 

Int 



request flag (on/off) 
request flag (on/off) 
request flag (on/off) 
request flag (on/off) 
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TABLE 1 -continued 



Field Name 


Field Type 


Remarks 






Numbe r Managed 


Int 


request flag 


(on/off) 


5 


NumberAssociated 


Int 


request flag 


(on/off) 





User Object 

Users and Servers use this object to register themselves with the 
administration database. They provide the information for subsequent 
logins (name, password) and other system-related info- The end-Users 

provide personal, demographic, and system- related information. 



Token 


Security Token Object 




Login 


Login Information Object login information 






(name, password) 


FirstName 


String 


User's first name 


LastName 


String 


User's last name 


Title 


String 


User's job title 


Company 


String 


User's employer 


Address 1 


String 


User's home street 




address 


Address2 


String 


User's address extra 


City 


String 


city, village 


State 


String 


state, province or 
foreign country 


ZipCode 


String 


zip or postal code 


Age 


String 


User's age 


Gender 


String 


User's gender 


PhoneNumber 


String 


telephone number 


FaxNumber 


String 


fax number 


Email 


String 


email address 


Demographics 


Dictionary 


market targeting extra 
User info 


Systemlnfo 


Dictionary 


system-related 
information 



15 



20 



25 



30 



Version Object 

All components of the system use this object to report their versioning 
information to the party they transact with in order to use a protocol 
they both understand. They are also given the chance to update 
themselves if a newer version exists. 



Token 
Major 

Minor 

Type 
Client 



Security Token Object 
Int 

Int 

Int 

Version 



major protocol version 
number 

minor protocol version 

number 

sender type 

client version 

information 



40 
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sequences. The version sequence starts after the computer 
initiating the transaction, the client, has established a con- 
nection with the computer completing the transaction, the 
server. The client sends a Version Object (defined in Table 1) 
and in response the server then sends back its own Version 
Object. This version sequence is used so that both client and 
server are aware of the version numbers of the software they 
are using. If a version number is older than expected, either 
client or server can choose to conform to the previous 
version or abort the transaction, depending on its needs and 
capabilities. If a version number is newer than expected, in 
most cases the current transaction can be completed since 
the software systems are designed to be fully backward 
compatible with previous versions. Additionally, in the case 
that the server of the transaction is the Administration 
Server, the client receives information about what the latest 
version number is and thus the client can be informed that 
a software update is needed. The process of handling auto- 
matic updating of User software is described more fully 
below. 

After the version sequence, one or more protocol 
sequences occur in which other objects are exchanged 
between client and server. When a particular protocol 
sequence is completed, another independent protocol 
sequence can be serviced. The protocol sequences that are 
part of the control architecture of the present invention are 
summarized in Table 2 and described below in conjunction 
with FIGS. 6-17. 



TABLE 2 



35 



Summary of Protocol Sequences 



Control Sequence Client 



Server 



Main Objects 
Exchanged 



User Registration 
and Login 
(see FIG. 6) 

User Login 
(sec FIG. 7) 



User 



Unlike traditional protocols based on state computers, the 
control protocol of the present invention is a light-weight, 
stateless protocol comprising simple sequences objects. It is 
light-weight in that in most sequences only two objects are 
involved in the transaction and after a sequence is completed 
the connection can be reused. It is also stateless in that the 
server maintains no information about the client. Every 50 
transaction is handled independently of the previous ones. 
States exist in the lower levels, for example within the TCP 
layer, to express logical states of a network connection but 
they are not actually part of the control protocol. 

In the preferred embodiment, the software- running on the 55 
Control Servers, Media Servers and Primary Servers is 
programmed for Windows NT and UNIX environment using 
the OLE environment. In addition, COM interfaces are used 
between components. The Rogue Wave system is used to 
transfer objects between the applications running on the 60 
various computers. The software running on the User com- 
puter is preferably programmed for a Windows 32-bit 
environment, so it will run on a Windows 95 or Windows NT 
computer. Alternatively, Macintosh and UNIX environments 
can be accommodated by other User software. $5 

The basic process of a control transaction consists of a 
version sequence followed by one or more protocol 
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Control 



Media 



Token Validation 
(see FIGS. 9A, 
9B) 
Server 

Registration and 
Login 

(see FIG. 10) 
Server Login 
(see FIG. 11) 



Control Server 
Activation 
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TABLE 2-coniinued 

Summary of Protocol Sequences 



Main Objects 



Control Sequence 


Client 


Server 


Exchanged 


Media Server 


Control 


Media 


Version Object 


Activation 




Server Activation 




(sec FIG. 13) 




Object 


(TCP connection 
stays open) 


Control Channel 


Administration 


Control 


Version Object 


Activation 






Channel 


(see FIG. 14) 






Activation Object 


Media Channel 


Control 


Media 


(open TCP 


Activation 






connection) 


(see FIG. 15) 






Channel Activ- 
ation Objects 


Distribution 


Media 


Media or 


Version Object 


Activation 




Primary 


MCI Objects- 


(see FIG. 36) 






OPEN/PLAY/ 
STOP/CLOSE 
Ping Objects 
(TCP connection 
stays open) 


Statistics Request 


Administration 


Control or 


Version Object 


(sec FIG. 17) 




Media 


Statistics Object 



The User registration and login sequences are the pro- 
cesses by which a new User registers with the system, logs 
in and retrieves programming information. The channel play 
sequence takes place when a User asks to listen to a 
particular channel. The token validation sequence is used to 
verify that a computer requesting a service is authorized to 
do so. The Server registration, login and activation 
sequences are used by Control and Media Servers when they 
become active. The Control Server and Media Server acti- 
vation sequences are used to manage the Control and Media 
Servers. The control channel, media channel and distribution 
activation sequences are used to cause a channel to be 
distributed to a Media Server. Finally, the statistics request 
is used for administrative purposes. 

FIG. 6 illustrates the User registration and login sequence 
in more detail. This sequence takes place after the User has 
installed the User software on his/her computer. It is 
expected that the User will download the software from the 
Internet and then invoke it which in the preferred embodi- 
ment will use the Windows Wizard interface. This will guide 
the User through the installation process including filling out 
the registration form, which we will describe more fully in 
the next section. After the User has selected a name and 
password and selected the option to register, the User 
computer opens a TCP connection to the Administration 
Server. Advantageously, the full domain name of the Admin- 
istration Server is embedded into the User software, 
although it could be discovered in other ways. The User and 
Administration Server then exchange version objects with 
the Administration Server as described above. If the version 
numbers meet expectations, the User sends a User Object to 
the Administration Server. The format of the User Object is 
shown in Table 1. Once the Administration Server receives 
the User Object, it verifies that the information is filled in 
properly and that the selected User name is unique. If the 
User Object is invalid for any reason, the Administration 
Server returns a Result Message Object with a code indi- 
cating the reason. The format of the Result Message Object 
is shown in Tabic 1. If the User information is valid, the 
Administration Server updates the global database of User 
names and passwords and then generates a security token for 
that User. This security token is then returned to the User in 
a Result Message Object. 
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Upon receiving the Result Message Object, the User 
saves the security token for future use. This token is an 
identifier that allows the User to request services from the 
Administration Server and other computers within the over- 

5 all system. The security token is not saved permanently or 
registered on the User computer. Normally, the User soft- 
ware then immediately sends a Channel Guide Request 
Object to the Administration Server and a Channel Guide 
Object is returned. 

10 The format of these objects is also shown in Table 1. Note 
that in principle, this is a separate transaction and could take 
place in a separate TCP connection to the Administration 
Server. In particular, once the User has registered and logged 
in, he/she can request the Channel Guide Object again since 
it may have been updated since the previous request. At this 

15 point the TCP connection to the Administration server is 
closed. 

The process of User registration only needs to take place 
once for each User. However, anyone can re-register at any 
time, even after the software has been installed. In particular, 

20 it is expected that if multiple persons use a computer, each 
person will register and obtain his/her own User name and 
password. If the registration process is not completed 
successfully, the User software saves the registration infor- 
mation and asks the User if they would like to try again the 

25 next time the software is invoked. 

Since the security token is not permanently saved by the 
User software, it is lost when the User software is closed, 
and the security token must again be retrieved from the 
Administration Server the next time the User wants to use 

30 the system. This process is the purpose of the login sequence 
illustrated in FIG. 7. This sequence is used if a User has 
already registered and needs only to retrieve a valid security 
token. In this case the sequence consists of the User's 
sending a Login Information Object to the Administration 

35 Server. The Administration Server then queries the User 
database to validate the login name and password. If the 
login name and password are correct, then a security token 
is returned to the User. Normally the receipt of the security 
token will immediately be followed by a channel informa- 

40 tion request sequence, just as in the registration sequence 
described previously. 

The control sequence that takes place when a User 
initiates a channel play operation is illustrated in FIGS. 8A, 
8B and 8C. First the User software requests a Control Server 

45 List from the Administration Server. Note that the Server 
List Request Object, illustrated in Table 1 contains a channel 
identifier. The Administration Server generates a sorted list 
of Control Servers based on overall system load and the 
location of the User on the network and returns this list to the 

50 User using a Protocol List Object. Once the Control Server 
List is returned to the User, the Administration Server is no 
longer needed and the TCP connection is closed. 

The User software then searches the list of Control 
Servers and opens a TCP connection to the first host listed. 

55 If that host computer does not respond, then the next Control 
Server on the list is tested and so forth in succession. Upon 
obtaining a response from a Control Server, the User soft- 
ware uses a Server List Request Object to request a Media 
Server List from the Control Server. If the Control Server is 

60 too busy to service the User, it returns a Result Message 
Object so indicating and the User software tries the next 
Control Server on the list. However, in the likely scenario 
that the Control Server is able to handle the User's request, 
a sorted list of Media Servers is generated and returned to 

65 the User computer using a Protocol List Object. The TCP 
connection to the Control Server is then closed by the User 
software. 
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At this point the User software initiates a TCP connection If a Media Server or Control Server that has sent a Server 

to the first Media Server on the list provided by the Control Activation Object to the Administration Server becomes 

Server. As in the previous case, it attempts to connect to the inactive, it will send another Server Activation Object indi- 

first host on the list and if unsuccessful tries the next hosts eating this condition. In the case of a Media Server, this 

in succession. Once the Version Objects are exchanged, the 5 object is sent to the managing Control Server. In the case of 

User software sends an MCI Request Object to the Media a Control Server, this object sent to the Administration 

Server. An MCI Request Object can be used for four basic Server. As in the case of User registration, Media Server and 

commands: OPEN, PLAY, STOP and CLOSE. The User Control Server registration needs only take place once per 

software must first send an OPEN command for the desired computer. However, if the computer is restarted, the server 

channel. If the returned Result Message Object indicates 10 must login and again retrieve a security token. This is the 

success, the User software then sends a PLAY command. server login and activation sequence shown in FIG. 11. 

When the Media Server receives a valid PLAY command, Once a Control Server has indicated to the Administration 

it initiates the delivery of audio information to the User as Server that it is ready, the Administration Server can activate 

described in the previous section. Note that this could be in that Control Server by sending the Control Server a Server 

the form of broadcast, multicast or unicast packets to a 15 Activation Object as illustrated in FIG. 12. This is a separate 

specific UDP port. The TCP connection through which the transaction and is used to tell the Control Server which 

MCI Request Objects were sent stays open during the audio Media Servers it is supposed to manage. Recall that a 

play operation. In addition, Ping Objects are sent to the User Control Server and a number of Media Servers form a 

on a periodic basis to verify that the computer is still cluster of Media Servers. The single Control Server thai 

working and active. When the User software receives a Ping 20 manages that cluster must be given a list -of host computers 

Object, it simply returns it. The Media Server uses the Ping corresponding to the Media Servers in that cluster. 

Objects to measure round trip time and also to determine The p roce ss by which a Control Server activates the 

when a User's computer has terminated abnormally. In that Media Servers that it manages is illustrated in FIG. 13. The 

case the audio stream is terminated. Control Server sends a Server Activation Object to the 

In the case of normal termination of the audio stream, the 25 Media Server indicating that it is responsible for channel 

User makes an explicit selection to stop and this causes a management. This TCP connection between the Control 

STOP command to be sent to the Media Server in an MCI Server and the Media Server stays open during the time that 

Request Object. The Media Server then terminates the audio both servers are active. The Control Server periodically 

stream to that User. When the User closes the application sends Ping Objects to the Media Server across this open TCP 

software or selects another channel to play, the User soft- 30 connection to verify that the Media Server is still running, 

ware will send a CLOSE command to the Media Server in FIG. 14 illustrates the process by which a given channel 

an MCI Request Object and the TCP connection is closed. is activated by the Administration Server. The Administra- 

The initiation of the audio stream by the Media Server tion Server opens a connection to a Control Server that its 

causes a log entry to be generated and sent to the Admin- 35 wishes to have carry a given channel and provide a Channel 

istration Server. This information is important so that the Activation Object. This object indicates to the Control 

Administration Server can update its database to indicate Server which Media or Primary Server the Control Server 

which Users are listening to which channels. The security should direct its Media Servers to get the feed from. At this 

token is used to identify the User initiating the audio stream. point the Control Server is said to be carrying that channel 

Additionally, when the audio stream is terminated to any ^ and it will be a valid host on a list of Control Servers 

User, another log message is generated and sent to the requested by a Channel Play sequence. 

Administration Server. FIG. 15 illustrates what happens when a Control Server 

FIG. 9A illustrates the process by which security tokens needs to provide a channel. First it sends a Channel Acti- 

are validated. The Administration Server is the only server vation Object to one of the Media Servers that it manages 

that can validate a security token. Thus, when a User 45 across the open TCP connection described previously. This 

requests services from a Control Server or from a Media object indicates to the Media Server that it should start 

Server, that server must go back to the Administration Server receiving the channel identified and from where it should 

with a token validation sequence. However, Control Servers receive it. 

and Media Servers are allowed to cache validations of In FIGS. 16A and 16B depict how the Media Server 

security tokens so that they do not have to validate tokens 50 requests distribution of an audio channel from another 

repeatedly once they have validated it the first time. In the Media Server or from a Primary Server. This sequence is 

case where a Media Server receives a request, the token will much the same as that in which a User requests the distri- 

be validated with the Control Server that is managing that bution of audio information from a Media Server. Note that 

Media Server. FIG. 9B identifies the various token valida- a Media Server receives a single incoming stream for each 

tion scenarios. 55 channel that it is carrying and will then redistributes this 

FIG. 10 illustrates the process by which a new Server is stream to all Users or other Media Servers that request it. 

registered. This process is similar to new User registration. Finally, FIG. 17 illustrates the statistics request sequence. 

It is expected, however, that the server installation will be This sequence is used by the Administration Server to gather 

through a Web interface rather than a Wizard. The Admin- information from the Media Servers and Control Servers in 

istration Server, upon receiving a User Object from a Media 60 order to manage the overall system. It can use this infor- 

Server or Control Server validates the User name and mation to detect failures and to balance load as the dynamic 

password and generate a security token just as in the case of conditions change. As indicated above, it can also use this 

User registration. Normally the Server then immediately information to monitor which Users are listening to which 

sends back a Server Activation Object indicating that it is channel or whether Users stop listening to a channel at any 

ready to be used as a system resource. Once this process has 65 time, such as during the play of a particular song. It can also 

been completed, the TCP connection to the Administration use this information to control the advertising content that is 

Server is closed. downloaded to a particular User in advance of receipt of 



